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In this paper, we consider the transition complexity of regular languages based on the incomplete 
deterministic finite automata. A number of results on Boolean operations have been obtained. It 
is shown that the transition complexity results for union and complementation are very different 
from the state complexity results for the same operations. However, for intersection, the transition 
complexity result is similar to that of state complexity. 

1 Introduction 

Many results have been obtained in recent years on the state complexity of individual and combined 
operations of regular languages and a number of sub-families of regular languages ll5l[8l [TTllT2l[T5l[T6ll . 
The study of state complexity has been mostly based on the model of complete deterministic finite 
automata (DFAs). When the alphabet is fixed, the number of states of a complete DFA determines the 
number of transitions of the DFA. Note that a description of a DFA consists of a list of transitions, which 
determine the size of the DFA. Incomplete DFAs are implied in many publications (HO- In quite a 
number of applications of finite automata, incomplete rather than complete DFAs are more suitable for 
those applications HI [13]]. For example, in natural language and speech processing, the input alphabet 
of a DFA commonly includes at least all the ASCII symbols or the UNICODE symbols. However, the 
number of useful transitions from each state is usually much smaller than the size of the whole alphabet, 
which may include only a few symbols [9]. Although the state complexity of such an incomplete DFA 
can still give a rough estimate of the size of the DFA, the number of transitions would give a more precise 
measurement of its size. 

In this paper, we consider the descriptional complexity measure that counts the number of transi- 
tions in an incomplete DFA. It is clear that for two DFAs with an equal number of states, the size of 
the description may be much smaller for a DFA where many transitions are undefined. Especially, for 
applications that use very large or possibly non-constant alphabets, or DFAs with most transitions un- 
defined, it can be argued that transition complexity is a more accurate descriptional complexity measure 
than state complexity. Before this paper, transition complexity was investigated only on nondeterministic 
finite automata [3l|4][6]|7]] and Watson-Crick finite automata iflOll . 

We consider operational transition complexity of Boolean operations. The transition complexity re- 
sults for union and complementation turn out to be essentially different from the known state complexity 
results lfT6l[T7l that deal with complete DFAs. Perhaps, as expected, the results for intersection are more 
similar with the state complexity results. For union we have upper and lower bounds that differ, roughly, 
by a multiplicative constant of 2. We conjecture that worst-case examples for transition complexity of 
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union need to be based on two DFAs that for some symbol of the alphabet have all transitions defined. 
For DFAs of this type we have a tight transition complexity bound for union. 

We can note that for union the state complexity results are also different for complete and incomplete 
DFAs, respectively. When dealing with incomplete DFAs the state complexity of the union of an n\ state 
and an n% state language is in the worst case n\ • ni +n\ + «2- 

2 Preliminaries 

In the following, £ denotes a finite alphabet, £* is the set of strings over £ and e is the empty string. A 
language is any subset of £*. When £ is known, the complement of a language L C £* is denoted as 
L c = £*-L. 

A deterministic finite automaton (DFA) is a tuple A = (£, Q,qo,F, 8) where £ is the input alphabet, 
Q is the finite set of states, F C Q is the set of accepting states and the transition function 8 is a partial 
function Q x £ — > Q. The transition function is extended in the usual way to a (partial) function 8 : 
Q x £* — > Q and also 8 is denoted simply by 8. The language recognized by A is L{A) = {w G £* | 
S(q ,w) G F }. 

Unless otherwise mentioned, by a DFA we mean always an incomplete DFA, that is, some transitions 
may be undefined. For more knowledge in incomplete automata, the reader may refer to [1]. The state 
complexity of a regular language L, sc(L), is the number of states of the minimal incomplete DFA 
recognizing L. 

The Myhill-Nerode right congruence of a regular language L is denoted =l ifTTl . The number of 
equivalence classes of =l is equal to sc(L) if the minimal DFA for L has no undefined transitions, 
sc(L) + 1 otherwise. 

If A = (£, Q,qo,F,8) is as above, the number of transitions of A is the cardinality of the domain of 
8, |dom(5)|. In the following the number of transitions of A is denoted # tr (A). We note that if A is 
connected (that is, all states are reachable from the start state), then 

iei-i<#tr(A)<|£|-ie|. (i) 

For b G £, the number of transitions labeled by b in A is denoted # tr (A,b). 

The transition complexity of a regular language L, tc(L), is the minimal number of transitions of any 
DFA recognizing L. In constructions establishing bounds for the number of transitions it is sometimes 
useful to restrict consideration to transitions corresponding to a particular alphabet symbol and we intro- 
duce the following notation. For b G £, the b-transition complexity of L, tc^(L) is the minimal number 
of ^-transitions of any DFA recognizing L. The following lemma establishes that for any b G £, the state 
minimal DFA for L has the minimal number of ^-transitions of any DFA recognizing L. 

Lemma 1 Suppose that A = (L,Q,qo,F,8) is the state minimal DFA for a language L. For any b G £, 

tc b (L)=# a (A,b). 

Since the result is expected, we omit the proof. Lemma Q] means, in particular, that for any given 
b G £ we cannot reduce the number of ^-transitions by introducing additional states or transitions for 
other input symbols. From Lemma[T]it follows that 

tc(L) = £tc,(L). 

bel. 

As a corollary of Lemma Q] we have also: 
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Corollary 1 Let A be the minimal DFA for a language L. For any b € E, the number of undefined 
b-transitions in A is sc(L) — tc b (L). 

To conclude this section, we give a formal asymptotic definition of the transition complexity of an 
operation on regular languages. Let be an m-ary operation, m > 1, on languages and let / : IN" 1 — > IN. 
We say that the transition complexity of is / if 

(a) for all regular languages L\,...,L m , 

tc(Q(L l ,...,L m ))<f(Xc(L 1 ),...,Xc( L m)), (2) 

(b) for any (ni,... ,n m ) G !N m there exist n\ > n,-, i = 1, . .. ,m, and regular languages L; with tc(L,-) = n\, 

i = 1, . . . , m, such that the equality holds in ©. 

The above definition requires that there exist worst case examples with arbitrarily large transition com- 
plexity matching the upper bound, however, we do not require that matching lower bound examples 
exist where the argument languages have transition complexity exactly for all positive integers n,-, 
i = 1 , . . . , m. 

3 Transition complexity of union 

We first give bounds for the number of transitions corresponding to a particular input symbol b € L. These 
bounds will be used in the next subsection to develop upper bounds for the total number of transitions 
needed to recognize the union of two languages. 

3.1 Number of transitions corresponding to a fixed symbol 

The upper bound for the ^-transition complexity of union of languages L\ and L 2 depends also on the 
number of states of the minimal DFAs for L,, i = 1,2, for which a ^-transition is not defined. From 
Corollary Q] we recall that this quantity equals to sc(L,) — tc b (Li). 

Lemma 2 Suppose that £ has at least two symbols and L\, L2 are regular languages over £. For any 

tc b (L 1 UL 2 ) < tc b (L 1 )-tc b {L 2 ) + 

tc b (Li ) ( 1 + sc(L 2 ) - tc b {La)) + tc h (La) (1 + sc(Li ) - Xc b (Li )) . (3) 

If n\,ri2 > 2 are relatively prime, for any 1 < kj < i = 1,2, there exist regular languages Li with 
sc(L,) = and tc^(L ( ) = kj, i = 1,2, such that the inequality (12) is an equality. 

Proof. Consider regular languages L ( - and let A, = (£, Qi,qo.i,Fj, 5,-) be a DFA recognizing L,-, i = 1,2. 
From A\ and A 2 we obtain a DFA for L\ UL 2 using the well-known cross-product construction ifTTl 
modified to the case of incomplete automata. We define 

qi _ [ Qi U {d} if Aj has some undefined transitions, ,_ 
1 1 Qi, otherwise, 

Now let 

B = (I, Q\ x Q' 2 , (90,1,90,2), (Fi x Q 2 ) U (Q\ x F 2 ), y), (4) 
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where for b G E, q\ G Q' { , i = 1 , 2, 



(8i(q[, b), 82^2, b)) if 8\{q\,b) and 8(q' 2 ,b) are both defined, 
(8i(q[,b),d) if 8i(q[,b) is defined and 8i{q'o,b) is undefined, 
(cf, 8i{q 2 ,b)) if is undefined and 82(q' 2 ,b) is defined, 

undefined, otherwise. 



(5) 



Note that above 8i(d,b), i= 1,2, is always undefined for any b G Z. 
We note that for iGl, 

= # tr (A 1 ,ft)-# tr (A 2 ^)+# tr (A 1 ^)+# tr (A 2 ^) + 

#tr(Al,6) • (|g 2 | -#tr(A 2 ,^)) +#tr(A 2 ,^) • (|fil | - #tr(Al, &)). 

Here 



(6) 



• # tr (Ai,&) -#tr(A2,^) is the number of transitions in ([5]) where both 8i(q' h b), i= 1,2, are defined, 

• # a (Ai,b) ■ {\Qj\ — # b (Aj,b)), {i,j} = {1,2}, is the number of transitions in ((5]> where Sj(q'[,b) is 
defined, 8j(q'j,b) is undefined and q'j G Qj, and, 

• #tr(A,-,&) is the number of transitions in ((5]> where 8i(q'[,b) is defined and q'j = d, = {1,2}. 

By choosing A; as the minimal DFA for I4, i = 1,2, and using Lemma Q] and Corollary [T] the right side 
of equation ((6]) gives the right side of inequality ((3]). Since B recognizes L\ UL2, tCb(L\ UL2) < # tr (B,b). 

We give a construction for the lower bound. Fix Let n\ > 1 be relatively prime, 1 < < 

/ = 1,2, and let c G L be a symbol distinct from b. Define 

Q = (^) {?0,i>9l,») • • • >#m-l,/}><ZO,i,{<70,i},5 ! ), 

where the transitions defined by 5,- are as follows: 

• 8i(q jti ,c) = q j+l j, j = 0, ... ,n t -2, 

• 8i(q ni -i :i ,c) = q Q j, 

• 8i{qjj,b) = q j:i , 7 = 0, 1. 




Figure 1: The transition diagram of the witness DFA C, of Lemma [2] 
The transition diagram of Q is shown in Figure Q] We note that # tT (Cj,b) = k\ and 

L(Q) = ((27*c)V>-*0*&*, / = 1,2. 

Clearly Q is minimal and hence sc(L(C,)) = rij, i = 1,2. In the following we denote L, = L(Q), i = 1,2, 
for short. Choose m\^m% G IN such that 



mi = (mod «i), mi = — 1 (mod 712), ^2 = (mod 722), m<i = —\ (mod n\). 
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Since n\ and n% are relatively prime, the numbers m,-, i = 1,2, exist. The intuitive idea is that we want 
that the string c m ' takes the automaton C\ to a state where the ^-transition is defined and the same string 
c'" 1 takes the automaton C2 to a state where the ^-transition is not defined. Recall that k 2 < n 2 and 
&z(<ln2-l,2,b) is undefined. Also, a similar property holds for c'" 2 with C\ and C2 interchanged. 

We define S = Si U S 2 U S3, where Sy = {& | < i < m ■ n 2 }, S 2 = {c m > be 1 | < i < m} and S 3 = 
{c m2 Z?c' I < i < n 2 }. We verify that all strings of S are pairwise in different equivalence classes of the 
right congruence ^l^Lt First consider c',c J G Si, < i < j < n\ -n 2 . Since n\ and n 2 are relatively 
prime, there exists k G {1,2} such that does not divide j — i. Denote by k' the element of {1,2} 
distinct from k. Select z G IN such that i + z = (mod n^) and j + z = 1 (mod ny). Now c'c z G and 
dc z G" L\ U L2. (7 + z ^ mod because does not divide j — i.) 

Consider c" 1[ bc l ,c m bd G S2, < i < j < n\. We note that strings of S 2 are not prefixes of any string 
inL2 and hence for z G IN such that m\ + i+z = (modni) we have c m[ bc l c z G Li and c mx bdc l $lL\ UL 2 . 
Similarly we see that any two elements of S3 are not in the same =1^12 -class. 

Next consider c' G Si and c m bd G S 2 , < i < n\ ■ n 2 , < j < n\. Choose z G IN such that i+z = 

(mod n 2 ) and m\ + j + z = 1 (mod Now c'c z G L2 and, since no string of S2 is a prefix of a string of 
L 2 , it follows that c" H bdc z $ L\ L)L 2 . Completely similarly it follows that a string of Si is not equivalent 
with any string of S3. 

As the last case consider ^bc' G S2 and c mi bd G S3, < i < n\, < j < n 2 . Choose z G IN such 
that m\ + i + z = (mod n\ ) and m 2 + j + z = 1 (mod 02)- Now c m ' bc'c z G Li and, since no string of S3 
is a prefix of a string in L\ , c m bdc z ^L\V>L 2 . 

Now we are ready to give a lower bound for the ^-transition complexity of L\ UL 2 . Let D be the 
minimal DFA for L\ UL 2 . By Lemma[T]we know that # tr (D,b) = tCb(L\ UL 2 ). 

For w G S, let g w be the state of D corresponding to w. We have verified that q Wl ^ q W2 when w\ ^w 2 . 
For c' G Si, < i < n\ • n 2 , the string c'b is a prefix of some string in L\ U L 2 if and only if 

i = j (mod n x ) for some < j < k x and some x G {1,2}. (7) 

The number of integers < i < n\ -n 2 that satisfy (O with value x G {1,2} is equal to k x ■ n y , where 
{x, y} = { 1 , 2}, and the number of integers < i < n\ -n 2 that satisfy © with both values x = 1 and x = 2 
is &i • &2- Thus, the number of states q w , w G Si for which the ^-transition is defined is k\n 2 + k 2 n\ — k\k 2 . 
For c" n bc' G S2, < i < n\, the string c mi bc l b is a prefix of some string of L\ UL 2 if and only if 

1 = j (mod n\), < j < k\. This means that the number of states q w , w G S2, for which the ^-transition 
is defined is k\. Similarly, S3 contains k 2 strings w such that the ^-transition is defined for the state q w . 

Putting the above together we have seen that 

# tr (D,b) > k\n 2 + k 2 ti\ — k\k 2 + k\ +k 2 . 

Recalling that sc(L,) = rc,-, tCfo(L ( ) = k{, i = 1,2, the right side of the above inequality becomes the right 
side of ([3]). Hence tc^(Li UL 2 ) is exactly k\n 2 + k 2 n\ —k\k 2 + k\ +k 2 . ■ 

Note that the above lower bound construction does not work with kf = i G {1,2}, because the 
proof relies on the property that some ^-transitions of C, are undefined. 

For given relatively prime integers n\ and n 2 , the construction used in Lemma [2] gives the maximum 
lower bound for tc^(Li UL 2 ) as a function of n, (= sc(L,)), i = 1,2, by choosing tc/,(L ; ) = rii — 1, i = 1,2. 
In this case also sc(L,) — tCfo(L,) = 1. 

On the other hand, by choosing k\=k 2 = \, Lemma [2] establishes that the ^-transition complexity 
of L\ UL 2 can be arbitrarily larger than the ^-transition complexity of the languages L\ and L 2 . These 
observations are stated in the below corollary. 
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Corollary 2 Suppose that the alphabet £ has at least two symbols and let b G £ be a fixed symbol ofL. 

(i) For any relatively prime integers n\ and n% there exist regular languages L,- with sc(L,) = 
tCfo(L,-) = 7i, — 1, i = 1,2, swcft f/to 



f ii) For any constants hi, £=1,2, and M > 1 fTzere en's? regular languages Li, £=1,2, smc/i tc^ (L,-) = ft;, 
£ = 1,2, and tc/,(Li UL 2 ) > M. 

For a given ft G £, Corollary |2](i) gives a lower bound for tc/j(Li UL2). The construction can be 
extended for more than one alphabet symbol as indicated in Corollary however, it cannot be extended 
to all the alphabet symbols. 

In the lower bound construction of the proof of Lemma|2j the language L, was defined by a DFA that 
has a c-cycle of length ni, and where exactly ki of the states had self-loops on symbol ft. We can get a 
simultaneous lower bound for the number of ^-transitions for any d G £ — {c,ft} by adding, in a similar 
way, self-loops on the symbol d. 

Corollary 3 Suppose that £ has at least two letters and fix c G £. Let n\ and 712 be relatively prime and 
for each ft G £ — {c} fix a number 1 < kj h < nu £ = 1,2. 
Then there exist regular languages L\ and L2 such that 



and the equality holds in (E])/or all ft G £ — {c}. 

Finally we note that the proof of Lemma [2] gives also the worst-case bound for the state complexity 
of union for incomplete DFAs. 

Corollary 4 If sc(L;) = ti,, £ = 1,2, the language L\ UL2 can be recognized by a DFA with at most 
7i 1 • «2 + n \ + n 2 states. For relatively prime numbers 711,712 > 2 the upper bound is tight. 

Proof. The upper bound follows from the construction used in the proof of Lemma |2] The upper bound 
is reached by the automata A\ and A2 used there for the lower bound construction (with any values 
1 <k { <n u i = 1,2). ■ 

3.2 Total number of transitions 

Here we give upper and lower bounds for the transition complexity of union of two regular languages. 

With respect to the total number of transitions for all input symbols, the lower bound construction of 
the proof of Lemma [2] maximizes tc(Li UL2) as a function of tc(L,), £ = 1,2, by choosing k\ = &2 = 1- 
In this case it can be verified that tc(Li UL2) = tc(Li) • tc(L2) + tc(Li) + tc(L2) — 2. However, when the 
alphabet has at least three symbols we can increase the lower bound by one, roughly as in Corollary |3]by 
observing that tc(L,) can be chosen to be zero as long as for each 1 < £ < 2 there exists ft G £ such that 
tc(L;) > 1. This is verified in the below lemma. 

Lemma 3 Let £ = {a,ft,c}. For any relatively prime numbers n\ and 712 there exist regular languages 
Li C £*, such that tc(L,) = ti, + 1, £ = 1,2, and 



tCh(Li UL2) = 71i7l2 +71i +712 — 3. 



SC(L/) =71/, tC b (Li) 



k iib , ft G£-{c}, £= 1,2, 



tc(LiUL 2 ) = tc(Li)-tc(L 2 ) + tc(Li) + tc(L 2 )- 1. 



(8) 
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Lemma|3]can be proved with a construction similar to the construction of the proof of Lemma[2]and 
witness languages L\ = a*{a*c ni )* and L2 = b*(b*c" 2 )*. Due to the page limitation, we omit the proof. 
Next we give an upper bound for transition complexity of union. In the following lemma let A\ and A 2 
be arbitrary DFAs and Ba { a 2 denotes the DFA constructed to recognize L(A\) UL(A2) as in the proof of 
LemmaEl (The definition of Ba 1 a 2 * s given in equation ((U).) 

Lemma 4 IfAi is connected, i = 1,2, then 

#tr(B Al ,A 2 ) < 2 - (# tr (A!) -# tr (A 2 ) +#tr(Ai) +# tr (A 2 )). 

Proof. We use induction on #tr(Ai) +# tr (A2). First consider the case where #tr(Ai) = # tr (A 2 ) = 0. In 
this case also Ba u a 2 has no transitions. 

Now assume that # tr (Ai) +# tr (A2) = m, and the claim holds when the total number of transitions is 
at most m—l. Without loss of generality, # tr (Ai) > 1, and let A\ be a connected DFA obtained from Ay 
by deleting one transition and possible states that became disconnected as a result. We can choose the 
transition to be deleted in a way that at most one state becomes disconnected. 

By the inductive hypothesis, 

#tr(B A [A 2 ) < 2 • (#tr(Ai) •# tr (A 2 ) + # tr (A\ ) + # tr (A 2 ) ) . (9) 

The DFA Ai is obtained by adding one transition t\ and at most one state q\ to A\. Let Q2 be the set of 
states of A2. The construction of Ba u a 2 is the same as the construction of B^i^, ex cept that 

(i) we add for t\ a new transition corresponding to each state of Q2 and a new transition corresponding 
to the dead state d in the second component, and, 

(ii) we add a new transition corresponding to q\ and each transition of A2. 
Thus, 

#tr(B Al A 2 ) < #UB A [m) + \Qz\ + 1 +#tr(A 2 ) < #tr(B A[A2 ) +2(# tr (A 2 ) + 1). 

The last inequality relies on £T|) and the fact that A2 is connected. Thus using (O and # tr (Aj ) = # tr (Ai ) — 1 
we get 

#*(Ba!m) ^ 2((# tr (A 1 ) - 1) •# tr (A 2 ) +# tr (A 1 ) - 1 +#, r (A 2 )) + 2(# tr (A 2 ) + 1). 

With arithmetic simplification this gives the claim for A\ and Ai. ■ 
From Lemma |3] and Lemma [4] we get now: 

Theorem 1 For all regular languages Li, i = 1,2, 

tc(Li UL 2 ) < 2 • (te(Li) • tc(L 2 ) + tc(Li) +tc(L 2 )). 

For any relatively prime numbers n\ and «2 there exist regular languages L,- over a three-letter alphabet, 
tc(L,) = Hi + 1, i = 1,2, such that 



tc(Li UL 2 ) = tc(Li) -tc(L 2 ) +tc(Li) +tc(L 2 ) - 1. 
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The upper and lower bound of Theorem Q] differ, roughly, by a multiplicative constant of two. We 
believe that the upper bound could be made lower (when tc(L,) > 2, i = 1,2), but do not have a proof for 
this in the general case. 

The constructions of Lemma [2] and Lemma [3] use languages L,, i = 1,2, such that for one particular 
alphabet symbol c € E, the minimal DFA for L,-, i= 1,2, has all c-transitions defined. It seems likely that 
worst-case examples need to be based on cycles of transitions on a particular alphabet symbol, in order 
to reach the maximal state complexity blow-up with as small number of transitions as possible. Below 
we establish that for this type of constructions the right side of ([8]> is also an upper bound for tc(Li UL2). 

Lemma 5 Let L\ and L 2 be regular languages over E. If there exists c £ E such that in the minimal DFA 
for Lj, i — 1,2, all c-transitions are defined, then 

tc(LiUL2) <tc(L 1 )-tc(L 2 ) + tc(L 1 ) + tc(L 2 )-l. (10) 

The idea of the proof of LemmaUJis similar to that of the proof of Lemma[4] The crucial difference is 
that we have one symbol for which all transitions are defined and the inductive argument is with respect 
to the number of the remaining transitions. Thus, in the inductive step when replacing A\ with a DFA 
Aj with one fewer transition, we know that A\ is connected and the inductive step does not need to add 
transitions corresponding to a state that would be added to A\ . 

Lemma [5] establishes that the bound given by Lemma [3] cannot be exceeded by any construction that 
is based on automata that both have a complete cycle defined on the same alphabet symbol. Usually it 
is easier to establish upper bounds for descriptional complexity measures, and finding matching lower 
bounds is a relatively harder question. In the case of transition complexity of union we have a lower 
bound and only indirect evidence, via Lemma [51 that this lower bound cannot be exceeded. 

Conjecture 1 For any regular languages L\ and L 2 where tc(L,) > 2, i = 1,2, 

tc(Li UL 2 ) < tc(Li) -tc(L 2 ) +te(Li) +tc(L 2 ). 

Note that the conjecture does not hold for small values of tc(L,-), i = 1,2. For example, tc({£}) = 0, 
tc(a*b m - 1 ) = m, buttc(a*b m - 1 +e)=m + 2. 

3.3 Transition complexity of union of unary languages 

For languages over a unary alphabet, the transition complexity of union of incomplete DFAs turns out to 
coincide with the known bound for state complexity of union of complete DFAs. However, the proof is 
slightly different. 

Recall that a DFA with a unary input alphabet always has a "tail" possibly followed by a "loop" (21 
fTTTj . Note that an incomplete DFA recognizing a finite language does not need to have a loop. 

Theorem 2 Let Li,L 2 be unary languages over an alphabet {b}. Iftc(Lj) >2, i= 1,2, then 

tc/XiULz) <tc(Li)-tc(L 2 ). (11) 

For any relatively prime n\ > 3, « 2 > 2, there exist regular languages Lj C {b}*, tc(L,) = nu i = 1,2, 
such that d771) is an equality. 

This theorem can be proved by separately considering the cases where L\ and L 2 are finite or infinite. 
The detailed proof is omitted. Note that the upper bound of Theorem |2] does not hold when n\ < 3 or 
n 2 < 2. For example, tc(b) = 1, tc((Z?")*) = n and tc(fcU (&")*) = n + 1 when n > 2. 
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4 Intersection and complementation 

As can, perhaps, be expected the worst-case transition complexity bounds for intersection are the same 
as the corresponding state complexity results based on complete DFAs. When dealing with intersection, 
worst-case examples can be constructed using a unary alphabet and a complete DFA. On the other hand, 
state complexity of complementation of complete DFAs is the identity function whereas the bound for 
transition complexity of complementation is significantly different. 

Proposition 1 For any regular languages Li, i= 1,2, 

tc(Li HL 2 ) < tc(Li) -tc(L 2 ). (12) 

Always when n\ and n 2 are relatively prime there exist regular languages L,-, tc(L,) = m, i = 1,2, such 
that equality holds in ( 1721 ). 



Proof. Let A; = (E, Qi,qoj,Fi, 5,-) be a DFA recognizing L h i = 1,2. We define 

B = (L,Q 1 xQ 2 ,(qo,i,qo,2),F 1 xF 2 ,y), (13) 
where for b € E, g,- G Qi, i = 1,2, 



(Si(qi,b),82(q2,b)) if 8\(qi,b) and &i{q 2 ^b) are both defined, 
undefined, otherwise. 



Clearly B recognizes L\ C\L 2 and # tr (B) = #tr(Ai) -#tr(A2). 

The lower bound follows from the observation that if n\ and n 2 are relatively prime and A,- is the 
minimal DFA for (b m )*, i= 1,2, then the DFA B in (fT3l is also minimal and # tr (B) = n\n 2 . ■ 

The proof of Proposition Q] gives for b € E the same tight bound for the number of ^-transitions 
needed to recognize the intersection of given languages. 

Corollary 5 For any regular languages Li over E, i = 1,2, and b € E, 

tc h (L l nL2)<tc b (L i )-tc b (L 2 ). (14) 

For relatively prime integers n\ and n 2 there exist regular languages Li with tofo(L, ) = nu i = 1,2, such 
that equality holds in M4\i . 

To conclude this section we consider complementation. If A is an n-state DFA, a DFA to recognize 
the complement of L(A) needs at most n + l states. The worst-case bound for transition complexity of 
complementation is significantly different. 

Proposition 2 Let L be a regular language over an alphabet E. The transition complexity of the com- 
plement ofL is upper bounded by 

tc(L c ) < |E|-(tc(L) + 2). 

The bound is tight, that is, for any n>\ there exists a regular language L with tc(L) = n such that in the 
above inequality the equality holds. 
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Proof. Let A = (£, Q, qo,F, 8) be a DFA for L. The complement of L is recognized by the DFA 

B=(L,QU{d},q ,(Q-F)U{d},y), 

where for b G L 

, J 8(p,b) if p£Q and S(p,b) is defined, 

y ^ P > > ~ \ d if is undefined. 

Note that when p = d, 8(p,b) is undefined for all b G £. 

The DFA B has (|<2| + 1) • |E| transitions. If A is minimal, A has at least \Q\ — 1 transitions, and this 
gives the upper bound. 

We establish the lower bound. Choose Jsl and for n > 1 define L„ = {b n }. Now tc(L„) = n. Denote 
5 = {s,b, . . . ,b" +l }. All strings of S are pairwise inequivalent with respect to the right congruence =ic, 
and 

(Vjc G 5)(Vc G E)(3y G I*) ;ccy G L*. 

This means that the minimal DFA for L c n has (at least) n + 2 states for which all transitions are defined. 
Thus, tc(L£) > |Z|-(n + 2). ■ 

From the construction of the proof of Proposition[2]we see that if £ contains at least two symbols then 
for a G E and any M > 1 there exists a regular language L over £ such that tc a (L) = and tc a (L c ) > M. 
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